Effect of Data Compression on Pattern Matching in Historical Data

نویسندگان

  • Ashish Singhal
  • Dale E. Seborg
چکیده

It is a common practice in the process industry to compress process data before it is archived. However, compression may alter the original data in a manner that makes extracting useful information from it more difficult. In this paper, popular data compression methods and their effect on pattern matching in historical data are evaluated. Pattern matching is performed using principal-component analysis based similarity factors. Simulation results indicate that waveletbased compression provides the best compression for pattern matching, while compression using OSI PI software produces the best reconstruction of data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Compression Issues with Pattern Matching in Historical Data

It is a common practice in the process industries to compress plant data before it is archived. However, compression may alter the data in a manner that makes it difficult to extract useful information from it. In this paper we evaluate the effectiveness of a new pattern matching technique1 for applications involving compressed historical data. We also compare several data compression methods w...

متن کامل

A New Compression Method for Compressed Matching

A practical adaptive compression algorithm based on LZSS is presented, which is especially constructed to solve the compressed pattern matching problem, i.e., pattern matching directly in a compressed text without decompressing.

متن کامل

On The Role of Pattern Matching In

In this paper, the role of pattern matching information theory is motivated and discussed. We describe the relationship between a pattern's recurrence time and its probability under the data generating stochastic source. We motivate how this relationship has led to great advances in universal data-compression. We then describe non-asymptotic uniform bounds on the performance of data compression...

متن کامل

Pattern Matching in Compressed Texts and Images

This review provides a survey of techniques for pattern matching in compressed text and images. Normally compressed data needs to be decompressed before it is processed, but if the compression has been done in the right way, it is often possible to search the data without having to decompress it, or at least only partially decompress it. The problem can be divided into lossless and lossy compre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005